Designing a multimodal dialogue system for information retrieval
نویسندگان
چکیده
This paper introduces a paradigm for designing multimodal dialogue systems. An example system task of the system is to retrieve particular information about different shops in the Tokyo Metropolitan area, such as their names, addresses and phone numbers. The system accepts speech and screen touching as input, and presents retrieved information on a screen display. The speech recognition part is modeled by the FSN (finite state network) consisting of keywords and fillers, both of which are implemented by the DAWG (directed acyclic word-graph) structure. The number of keywords is 306, consisting of district names and business names. The fillers accept roughly 100,000 non-keywords/phrases occuring in spontaneous speech. A variety of dialogue strategies are designed and evaluated based on an objective cost function having a set of actions and states as parameters. Expected dialogue cost is calculated for each strategy, and the best strategy is selected according to the keyword recognition accuracy.
منابع مشابه
Natural Language Dialogue System for Information Retrieval
The objective of our work is the development of a natural language dialogue system for information retrieval with multimodal input and multimedia output. Overall, the system consists of three phases: input analysis, information and knowledge management and output generation. The dialogue system is designed for consulting old Mexican historical documents. In this paper we describe the designed a...
متن کاملA Testbed for Evaluating Multimodal Dialogue Systems for Small Screen Devices
This paper discusses the requirements for developing a multimodal spoken dialogue system for mobile phone applications. Since visual output as part of the multimodal system is limited through the restricted screen size of mobile phones, research in the field of information visualisation for small screen devices are discussed and combinations of these techniques with spoken output are sketched. ...
متن کاملPrinciples and design of an intelligent system for information retrieval over the internet with a multimodal dialogue interface
In the information society of the next millenium, information retrieval over the Internet will be indispensable for everyday life, and spoken language will be an essential medium for human-machine dialogue. This paper presents an overview of an intelligent system for information retrieval based on spoken dialogue, use of key concepts, processing of unknown words, knowledge acquisition, and agen...
متن کاملPublic Transport Ontology for Passenger Information Retrieval
Passenger information aims at improving the user-friendliness of public transport systems while influencing passenger route choices to satisfy transit user’s travel requirements. The integration of transit information from multiple agencies is a major challenge in implementation of multi-modal passenger information systems. The problem of information sharing is further compounded by the multi-l...
متن کاملDORIS, a multiagent/IP platform for multimodal dialogue applications
This article presents an effort to define a multimodal Agentbased dialogue platform cooperating with Internet technologies. We propose an open architecture integrating voice based and graphical user interfaces. This work is highlighted by a demonstrator, GEORAL, for the tourist information retrieval domain.
متن کامل